====== Data Sources ======

My model works by trying to "guess" the values of some [[..:parameters:start | parameters]] so that the [[..:model:start | model]] dynamics best fits empirical data of the cumulative number of cases and deaths counted over time.  The scientific and medical communities have made great efforts to [[http://ebola-wiki.com/List_of_Ebola_related_data_sources?ModPagespeed=noscript | share data widely]] and I've found several sources I rely on regularly.  Thanks to these data curators:
  * [[https://en.wikipedia.org/wiki/Ebola_virus_epidemic_in_West_Africa | Wikipedia contributors]],
  * [[https://github.com/cmrivers/ebola/raw/master/country_timeseries.csv | Caitlin Rivers]],
  * [[https://github.com/montanaflynn/ebola-outbreak-data/raw/master/ebola-outbreak-data.csv | Montana Flynn]], and
  * [[https://docs.google.com/spreadsheets/d/1t-CA4FCupjXDC6rM5_aKCd_LnpO0MwPd1PCrNxwiYoM/export?gid=1593937508&format=csv | Jonathan Dugan]].

When I run my model (ie. systematically try to "guess" the parameter values) I first load the empirical data from the above sources and then merge them into a single timeseries for total global cases and deaths.  You can read my [[..:scripts:start | scripts]] to see how I load and merge the data.  I try to remove duplicates found in multiple datasets but they can still occur (for example, if the same data are marked with different datestamps).  It may seem good to have extra data but if they're redundant they can be dangerous: they might give the impression that your data are more reliable than they actually are.  I've tried to design my model and analysis so redundant data don't give me false impressions.

I also estimate a single value of the [[..:parameters:start | parameter]] $\phi$ to anchor it.  Here's the spreadsheet and data:
  * [[http://www.zoology.ubc.ca/~rikblok/ebolamodel/logphiwho.xlsx | logphiwho.xlsx]]